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(57) Abstract 

A protein having erythroid potentiating activity 
(EPA). The EPA protein has a biological activity of at 
least about 10 6 units per mg of protein and has an ap- 
parent molecular weight of about 28,000 daltons as de- 
termined by SDS-PAGE analysis on 10 percent polyac- 
rylamide gel. Also described is a cloned EPA gene cod- 
ing for such EPA protein, a recombinant vector con- 
taining the EPA gene, a microorganism or mammalian 
cell transformed with such a recombinant vector, and a 
method for producing EPA by expressing said EPA 
gene by culturing said microorganism or mammalian 
cell. Purified EPA stimulates the growth of both early 
and late erythroid precursors from human bone mar- 
row, as well as colony formation by the K562 erythrol- 
eukemia cell line. 
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VECTORS CONTAINING A GENE FOR A PROTEIN HAVING 
ERYTHROID- POTENTIATING ACTIVITY AND RECOMBINANT 
DMA MRTHOnS PQR PRODUCT Nft Sim ^^KTN 



Field q£ the Tnvftntlon 

Thia invention relates to the production of a protein having 
erythroid-potentiating activity (EPA) and particularly to a pure 
EPA protein, to vectors containing the gene for expressing said • 
protein, to microorganisms and mammalian cells transformed with 
said vectors, and to methods for producing said protein by 
recombinant DNA techniques. 

Background of the Tnvenfcian 

The many different cell types found in blood are all derived 
from pluripotent' hematopoietic stem cells, stem cells perform 
two functions! (1) they reproduce themselves, thereby 

~ maintaining a stem cell population in the body and (2) they 
provide progeny committed to differentiate into any of the mature 
blood cell types. A cell which is committed to differentiate 

. along a particular hematopoietic pathway is termed a progenitor 
cell. Progenitor cells for T lymphocytes, B lymphocytes, 
granulocytes, macrophages, red blood cells, platelets, and 
eosinophils, as well as earlier progenitors which can 
individually give rise to several of the mature cell types, have 
been studied experimentally both la vivo and ia vitro (Dexter, 
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T.M. 1983 J. Pathology Ml 4X5-433) . It has been determined in 
Vitro that proliferation and/or differentiation of each 
progenitor cell type depends upon "factors" which have been 
derived from various sources. For example, T lymphocytes require 
T-cell growth factor (IL-2) and some of the later progenitors of 
red blood cells require a factor called erythropoietin. 

Several general classed of erythrocyte (red blood cell) 
progenitors have been distinguished by the time course of colony 
formation in vitro and the characteristics of the colonies 
produced. Although there is a continuum with respect to the 
maturity of erythroid colony-forming cells, the most mature of 
these is generally referred to as colony-forming unit - erythroid 
(CPU-E) , giving rise to relatively small colonies consisting of 8 
to 64 heraoglobinized cells. The burst-forming unit (BFO-E) is a 
more primitive erythroid progenitor. The BFO-Es require more 
time to mature than CPU-Es and they can give rise to very large 
colonies containing up to thousands of cells with multiple 
subcolonies. The CPU-E is thought to arise from the BPO-E. The 
CFU-E is sensitive to erythropoietin CEPO) whereas the BPU-E is 
relatively insensitive to EPO. 

Several in iisa studies have suggested that erythropoietin 
does not affect the pluripotent hematopoietic stem cell or the 
early erythroid progenitor cell (BPO-E) . As a result of these 
studies, iscove postulated the existance of an independent 
modulator of multipotent stem cells and early committed erythroid 
cells. This hormone or group of hormones has been referred to as 
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burst-promoting activity (BPA) or erythroid-potentiating activity 
(EPA) because it ia primarily assayed by its ability to stimulate 
the growth of BPU-Es. Studies of BPA have generally employed 
unpurified material, derived from various sources. One murine 
protein IL-3, which affects all myeloid progenitors including 
cells of the erythroid .lineage as well as stem cells, has 
recently been purified to homogeneity and the cDNA for this 
protein has been cloned. 

Common sources for EPA are peripheral blood mononuclear 
cells, bone-marrow conditioned medium, urine from anemic 
patients, serum, and normal and neoplastic cells of T-lymphocyte 
and mononuclear phagocyte lineage. The most widely used source 
of EPA is lectin-stimulated lymphocytes. Medium conditioned by 
phytohemagglutinin (PHA-) , concanavalin-A- and pokeweed 
mitogen-stimulated lymphoid populations consistently produces 
EPA. Erythroid-potentiating activity is also released by 
lymphoid cells responding to antigen, it is most likely that in 
these circumstances the EPA is being elaborated by activated T 
lymphocytes. Consistent with this in vitro evidence for 
production of EPA by T lymphocytes are studies in which erythroid 
colony formation in vivo is greatly stimulated by addition of T 
lymphocytes. 

Conditioned medium from heterogeneous cell populations has 
proved useful for biologic studies of the action of EPA; however, 
these materials have not been highly suitable for biochemical 
investigations. For this reason, the search for cell lines 
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producing EPA has been actively pursued. Several cell lines are 
known to produce EPA. In general, human T-cell lines composed of 
mature helper T cells, which elaborate other lymphokines, produce 
EPA. Undifferentiated T-cell lines, which are widely used in 
studies of T-cell function, do not appear to produce EPA. Two 
human raonocyte-like cell lines, U-937 and GCT elaborate EPA. 

The characterization and partial purification of a human 
T-lyraphocyte-derived factor that stimulates erythroid progenitors 
la vitro was reported by Lusis and Golde in Proceedings of thg 
Second Conference on Hemoglobin switching. Q. Stammatoyannopoulos 
and A.W. Nienhuis, eds., Gume & stratton (1981). Their source of 
this factor (EPA) is a human T-lymphoblast cell line (Ho) that 
was derived from a patient with a T-cell variant of hairy cell 
leukemia. Medium conditioned by Ho cells in the absence of serum 
is a rich source of EPA, and using this source EPA was purified 
about 250-fold. The final product stimulated human CFU-E and 
bpu-e progenitors in vitor in nanogram quantities. Physically, 
EPA appears to be an acidic glycoprotein of molecular weight 
about 45,000, although it exhibits a degree of size and charge 
heterogeneity. EPA was separable from a granulocyte-macrophage 
colony-stimulating factor (G,H-CSP) also produced by Ho cells. 
However, the ■purified" EPA retained an activity that stimulates 
the formation of mixed myeloid colonies In vitro , and may be 
employed to stimulate the formation and growth of myeloic cells 
including granulocytes, macrophages/ megacaryocytes and mast cells • 

Because of its stimulating activity on erythroid 
progenitors/ EPA can be useful for the treatment of anemia 
associated with chronic renal failure and for adjunctive therapy 
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in patients receiving cancer or other chemotherapy. An adequate 
source of EPA will also be of great value in medical research to 
study the growth and formation of erythroid cells. EPA treatment 
can also be used for chronic diseases having associated anemia 
such as rheumatoid arthritis, tuberculosis, osteomyelitis, chronic 
urinary tract infections, and some malignancies. As in the 
anemia or chronic renal failure, EPA will be useful in treating 
the low red blood cell counts of patients with the above chronic 
diseases, however it will not treat the underlying basic disease 
process. 

Because the effect of EPA will be to reduce the numbers of 
transfusions required by patients having chronic renal failure, 
the likelihood of contracting transfusion related infections 
which include hepatitis A, hepatitis B, non A and non B 
hepatitis, infectious mononucleosis, CMV disease, toxoplasmosis, 
malaria, syphillis and trypanosomiasis will be reduced. 

Thus, it can be appreciated that it would be desirable to 
have a source for EPA that could readily supply this protein in 
quantities sufficient for the above uses. Recently developed 
techniques have made it possible to employ microorganisms or 
eukaryotic cells, capable of rapid and abundant growth, for the 
synthesis of commercially useful proteins and peptides, 
regardless of their source in nature. The general approach is to 
isolate (clone) the DNA sequence (gene) encoding the protein of 
interest, to insert that gene into a nucleic acid vector 
containing signals for abundant expression of the protein, and to 
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transfer the vector with inserted gene into a suitable host cell. 
This host-vector combination can be grown in volume to produce 
quantities of the protein, and the protein can then be recovered 
by known separation and purification techniques. 

One method of gene isolation is based upon knowing part or 
all of the amino acid sequence of the protein encoded by the 
gene. Each of the twenty commonly occurring amino acids is 
encoded by one or more triplets of nucleotides called codons (for 
discussion see J.D. Watson r Molecular Biology n f the n*n* - ( W . A . 
Benjamin, Inc., 1977, p. 347-377). Knowing the 
deoxyribonucleotide sequence of the gene (or DNA sequence which 
codes for a particular protein) allows the exact description of 
that protein's amino acid sequence. However, the converse is not 
true; for example, although tryptophan or methionine are coded 
for by only one codon, the other amino acids can be coded for by 
up to six codons. Thus, there is considerable ambiguity in 
predicting the nucleotide sequence from the amino acid sequence. 
However, a pool of synthetic oligonucleotides, corresponding to 
all possible combinations of codons for a particular sequence of 
amino acids within a protein, should include the nucleotide 
sequence which in fact occurs in the gene encoding that protein. 
Such synthetic oligonucleotide pools can be used to screen cloned 
gene libraries by standard techniques in order to identify the 
clone of interest. 

Determination of part or all of the amino acid sequence of a 
protein, as a basis for synthesis of oligonucleotide screening 
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probes, requires that the protein be purified to homogeneity, m 
the case of EPA, prior to the present invention, only a partial 
purification had been achieved which was not sufficient to obtain 
the required amino acid sequence to synthesize suitable screening 
probes. 

Summary of fch» Tn^nH^ 

The present invention overcomes these problems and provides 
a ready souce o£ EPA using recombinant DNA technology. Thus, the 
present invention provides a substantially pure EPA protein 
having a biological activity of at least about 10« units per mg 
of protein and having an apparent molecular weight of about 
28,000 daltons as determined by SDS-PAGE analysis on 10 percent" 
polyacrylamide gel, a cloned EPA gene coding for such EPA 
protein, a recombinant vector containing the EPA gene, a 
microorganism or mammalian cell transformed with such a 
recombinant vector, and a method for producing EPA by expressing 
said EPA gene by culturing said microorganism or mammalian cell. 

Purified EPA stimulates the growth of both early and late 
erythroid precursors from human bone marrow, as well as colony 
formation by the K562 erythroleukemia cell line. 

Brief nPflcrinHttn nf th» n^ nt|f1 

Pig. 1 is a schematic illustrating the preparation of 
plasmid pTPL from plasmid pAdD26SVpA<3) . 

Pig. 2 is a schematic continuing from Pig. l and 
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illustrating the preparation of plasmid p91023 from plasmid pTPL. 

Pig. 3 is a schematic continuing from Pig. 2 and 
illustrating plasmid p91023(B). 

Pig. 4 sets forth a nucleotide sequence containing an EPA 
gene cloned in accord with one embodiment of this invention along 
with the deduced amino. acid sequence of the EPA protein that the 
gene encodes. 

Fig. 5 sets forth another nucleotide sequence containing a 
sequence coding for EPA in accord with another embodiment of the 
invention along with the deduced amino acid sequence of the EPA 
protein encoded thereby. 

Detailed nefloMpUnn of fch» Tnt, fnf jft n 

The following definitions are supplied in order to 
facilitate the understanding of this case. To the extent that 
the definitions vary from meanings circulating within the art, 
the definitions below are to control. 

Amplification means the process by which cells produce gene 
repeats within their chromosomal DNA. 

Downstream means the direction going towards the 3' end of a 
nucleotide sequence. 

An enhancer is a nucleotide sequence that can potentiate the 
transcription of genes independent of the identity of the gene, 
the position of the sequence in relation to the gene, or the 
orientation of the sequence. 

A gene is a deoxyribonucleotide sequence coding for a given 
mature protein. Por the purposes herein, a gene shall not 
include untranslated flanking regions such as rna transcription 
initiation signals, polyadenylation addition sites, promoters or 
enhancers. 
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A selection gene is a gene that confers a phenotype on cells 
which express the gene as a detectable protein. 

A selection agent is a condition or substance that enables 
one to detect the expression of a selection gene. 

Phenotype means the observable properties of a cell as 
expressed by the cellular genotype. 

Genotype means the genetic information contained within a 
cell as opposed to its expression, which is observed as the 
phenotype. 

Ligation is the process of forming a phosphodi ester bond 
between the 5' and 3' ends of two DNA strands. This may be 
accomplished by several well known enzymatic techniques, 
including blunt end ligation by T4 ligase, • 

Orientation refers to the order of nucleotides in a DNA 
sequence. An inverted orientation of a DNA sequence is one in 
which the 5 • to 3 ' order of the sequence in relation to another 
sequence is reversed when compared to a point of reference in the 
DNA from which the sequence was obtained, such points of 
reference can include the direction of transcription of other 
specified DNA sequences in the source DNA or the origin of 
replication of replicable vectors containing the sequence. 

Transcription means the synthesis of rna from a DNA 
template. 

Transformation means changing a cell's genotype by the 
stable transfection of cells with exogenous DNA. Transformation 
may be detected in some cases by an alteration in cell phenotype. 
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Transfonned cells are called transf ormants. Pre- transformation 
cells are referred to as parental cells. 

Translation means the synthesis of a polypeptide from 
messenger RNA. 

Erythroid-potentiating activity (EPA) can be derived from a 
number of cellular sources including peripheral blood mononuclear 
cells, bone-marrow conditioned medium, urine from anemic 
patients, serum, and normal and neoplastic cells of T-lymphocyte 
and mononuclear phagocyte lineage. Because EPA has not been 
purified to homogeneity and because the assay for EPA is complex, 
the amino acid sequence for EPA is unknown. Thus, an 
oligonucleotide probe for EPA raRNA could not be predicted with 
any degree of certainty from any literature sources. 

However, there have been reports of partial purification of 
EPA. For instance, Lusis and Golde, supra, reported a 250-fold 
purification of EPA from the Mo cell line with the resulting 
product having a specific activity of about 51,000 units/mg 
protein and porter and Ogawa, Blood. 52 p . 12 07 (1982) have 
reported a 300-fold purification of EPA from human bone-marrow 
conditioned medium. Also Iscove at pp. 37-52 in Golde, et al., 
Eds " HeiBatOnolfM-ic Cell nifferent-^l-^n. TCM-nrr.i S vnmn R i a on 

Mo l ecular find CeJlnlnr Biology , vol. 10, Academic press, New York 
(1978) reported the purification of EPA from mouse spleen cells, 
Dubes, et al, at pp. 119-128 in Golde, et al., Eds., supra, and 
in E X P , ffftnif i tftl .. 2. (Supp. 8) p. 128 (1980) reported the 
separation of EPA from human urine, and recently Abboud, et al., 
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Blfifid, M PP. 1148 (1981) reported the use of hydrophobic 
adsorption chromatography to separate EPA from the 
colony-stimulating activity factor (CSA) . In general, the 
activities that have been characterized appear to be 
glycoproteins of molecular weight 30,000 to 45,000 and the 
reported heat stabilities have varied considerably. 

EPA was partially purified from Ho cell line conditioned 
medium in accord with the procedures described in Lusis and 

Golde ' Proceedings of the spcond n H » m ™i„K 1n 

Sw i tching , supra. The partially purified EPA would not permit an 
analysis sufficient to construct possible oligonucleotide probes 
for epa. Thus, further purification to substantial homogeneity 
was required. Such purification was accomplished in accord with 
the present invention by sequential concentration, ammonium 
sulfate precipitation, lentil-lectin affinity chromatography, gel 
filtration and reverse phase high performance liquid 
chromatography (HPLC) . EPA was assayed by its ability to 
stimulate the growth of large erythroid colonies (bursts) from 
normal human peripheral blood. The purified EPA in accord with 
the present invention has an apparent molecular weight of 28,000 
and appears as a single band when analyzed by SDS-PAGE under 
reducing or nonr educing conditions. 

The EPA protein was analyzed on a gas phase sequenator to 
obtain, a paartial amino acid sequence which was sufficient to 
predict possible DNA sequences and make oligonucleotide probes. 
All sixty-four possible 12-mers that could encode residues 9-13 
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of the analyzed amino acid sequence, all sixty-four possible 
14-raers that could encode residues 21-25 of the analyzed amino 
acid sequence, and two pools each of sixteen possible 53-mers 
(based on coding usage preferences) that could encode residues 
9-26 of the analyzed amino acid sequence were prepared. 

T-lymphocytes were induced with phytohemagglutinin (PHA) and 
phorbol myristic acid (PHA) to enhance their lymphokine 
production. The cells were centrif uged and cytoplasmic RNA was 
prepared by a standard gentle lysis procedure. Polyadenylated 
messenger RNA was then prepared by chromatography on oligo dT 
cellulose. 

First strand cDNA was prepared using standard methods using 
the raRNA prepared above. To enrich for sequences specific for 
lymphokines, the first strand cDNA was hybridized with an excess 
of raRNA f rom cells that do not make lymphokines. Most of the 
sequences expressed in the T-lymphocytes that produce lymphokines 
will also be expressed in, for instance, B-cell lines and 
immature T-cell lines. A small fraction of sequences will be 
found only in the starting T-lymphocyte cell line. Most of the 
single stranded cDNA will anneal to the raRNA from the B-cell 
lines or immature T-cell lines and become double stranded. The 
cDNA unique to the starting T-lymphocyte cell line remains single 
stranded. The single and double stranded DNA are separated by 
chromatography on hydroxy lapatite, thereby substantially 
concentrating the lymphokine sequences. 

The single stranded cDNA was then converted to 
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double-stranded form by treatment with the Klenow fragment of DNA 
polymerase I using standard techniques. The loopback form was 
cleaved with Sl-nuclease and C-tails were added using terminal 
transferase by standard methods. Finally, the C-tailed cDNA was 
annealed with G-tailed P BR322 and used to transform e. coli . The 
El CQ l * clones were spread onto agar plates and the colonies were 
lifted off with nitrocellulose filter disks. 

After growing these colonies, replicas were made by 
carefully placing a pre-wetted filter on top of the original 
filter. The replica filters were aligned with the original by 
punching 3 asymmetrical holes through the filters with a needle. 
The replica filters were removed to fresh plates and grown. The 
plasmid DNA in each colony was amplified in situ by transferring 
the filters to fresh plates containing 100 ug/ml chloramphenicol 
and incubating overnight. Finally, these replica filters were 
prepared for standard colony hybridization by treatment with 
base, followed by neutralization and drying. The dried filters 
were baked to fix the liberated DNA to the nitrocellulose. 

To identify EPA clones, one set of replica filters was 
probed with the pools of 12-mers and another corresponding set of 
duplicate filters was probed with the pools of 14-mers. 

Potential EPA clones were identified by aligning the 
duplicate filters and looking for individual colonies which 
hybridized to both independent probes (the 64 14-mers and 64 
12-mers). The clones which hybridized to both probes were picked 
by aligning the x-ray film of the hybridized filter with the 
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ociginal master fUter. These colonies were picked and grown in 
liquid culture for the preparation of analytical amounts of DNA 
(prepared by the standard base treatment method) . 

Samples of DNA from each of the picked clones were cleaved 
with Pstl to excise the inserts. These digests were analyzed by 
agarose gel electrophoresis. A variety of insert sizes was 
observed. The gel was next prepared for standard Southern 
analysis. Nitrocellulose strips were placed on either side of 
the gel. This sandwich was placed on top of a 1 inch stack of ' 
paper towels and a second stack of towels was placed on top of 
the gel. DNA was allowed to leach out of the gel overnight (in 
both directions) giving replica filters of the gel. These 
filters were baked and then probed overnight as described for the 
colony hybridization (1 filter with the 12-raers and 1 with 
14-mers). After washing the filters and exposing them to film it 
was observed that three clones (designated 27, 57, 58) hybridized 
most strongly to both probes. These three clones all clearly bad 
at least one internal Pstl site suggesting that they might encode 
the same sequence. To further test the clones, the Southern 
filters were washed to eliminate the radioactivity and were 
reprobed with the 2 pools of labeled 53-mers. Both sets of 
53-mers hybridized strongly only with the three clones designated 
27, 57, and 58 suggesting that these clones encoded the EPA 
sequence above. 

Each of the 8 pools of 14-mers was tested separately to 
determine which pool hybridized most strongly with the EPA 
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clones. Pool 14-5 proved to hybridize most strongly to clones 
27, 57, and 58 indicating that this pool of oligonucleotides 
would be a good primer for dideoxy sequencing of the candidate 
clones. 

DNA from each candidate clone was purified using cesium 
chloride equilibrium density centrif ugation and sequenced using 
standard dideoxy. sequencing with pool 14-5. as the primer. The 
complete nucleotide sequence was obtained by subcloning the Psti 
fragments of the clone (the EPA gene was found to have 3 internal 
Psti sites which generate four small fragments for sequencing) 
into M13 and sequencing them using the universal M13 primer. To 
align the sequences of the four Psti fragments, oligonucleotide 
primers were synthesized for use in sequencing the plasraid DNA 
directly (as described above with pool 14-5) . One of these 
primers 

d(T-G-C-A-C-C-T-G-T-G-T-C-C-C-A-C-C-) 
encoded the end of the mature protein and could be used for 
insertion into the bacterial expression vector as well as 
sequencing at the 5' end of the coding region. A second primer; 

d(G-A-G-G-A-G-T-T-T-C-T-C-A-T-T-G-C-T-G) 
encoded the region near the 3' end of the first Psti fragment and 
could be used to sequence through the three psti sites towards 
the 3» end of the clone. The final primer; 

d(C-C-A-C-A-A-G-C-A-A-T-G-A-G-G-T-G-C-C-) 
was complementary to the mWJA strand near the 5» end of the 3 • 
Pst fragment of the clone and could be used to sequence toward 
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the 5" end of the clone through the 3 Pstl sites. The complete 
DNA sequence of the EPA gene of clone 57 is illustrated in Pig. 4 
along with the amino acid sequence of the translated protein 
product. 

Once the full length EPA cDNA clone is obtained, known and 
appropriate means are utilized to express the EPA protein, e.g. 
insertion into an appropriate vector, transfection into an 
appropriate host, selection of transf ormants, and culturing these 
transformants to express EPA activity. 

Host- vector systems for the expression of EPA may be 
procaryotic, but the complexity of EPA may make the preferred 
expression system a mammalian one. This is easily accomplished 
by transformation or by microinjection of eucaryotic (usually 
mammalian or vertebrate) cells with a suitable EPA vector. 
Eucaryotic transformation is in general a well-known process, and 
may be accomplished by a variety of standard methods. These 
include the use of protoplast fusion, chromosome transfection, 
lytic and nonlytic viral vectors (For example, Mulligan et al., 
•Nature- (London) 277:108-114 U979J, cell-cell fusion (Pournier 
et al., "Proc. Nat. Acad. Sci." 21*319-323 11977), lipid 
structures (U.S. patent 4,394,448) and cellular endocytosis of 
DNA precipitates (Bachetti et al., -Proc. Nat. Acad. Sci. 
74:1590-1594 C1977J). 

Transformation which is mediated by lytic viral vectors is 
efficient but is disadvantageous for a number of reasons: the 
maximum size of transf ected DNA is limited by the geometry of 
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viral capsid packing, the exogenous genes are frequently deleted 
during viral replication, there is a requirement for helper virus 
or specialized hosts, host cells must be permissive, and the 
hosts are killed in the course of the viral infection. 

, Nonlytic transformations are based on the transcription and 
translation of virus vectors which have been incorporated into a 
cell line as a stable episome. These systems generally require 
unique cell lines and suffer from a number of disadvantages. See 
"Trends in Biochemical Sciences-, June 1983, pp. 209-212. 

On the other hand, other transformation methods in which 
extrachromosomal DNA is taken up into the chromosomes of host 
cells have been characterized by low frequencies of 
transformation and poor expression levels. These initial 
difficulties were ameliorated by transformation with genes which 
inheritably confer selectable phenotypes on the small 
subpopulation of cells that are in fact transformed (selection 
genes). The entire population of transformed cells can be grown 
under conditions favoring cells having acquired the phenotype, 
thus making it possible to locate transf ormants conveniently. 
Thereafter, transf ormants can be screened for the capability to 
more intensely express the phenotype. This is accomplished by 
changing a selection agent in such a way as to detect higher 
expression. 

Selection genes fall into three categories: Detectably 
amplified selection genes, dominant selection genes, and 
detectably amplified dominant selection genes. 
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Detectably amplified selection genes are those in which 
amplification can be detected by exposing host cells to changes 
in the selection agent. Detectably amplified genes which are not 
dominant acting generally require a parental cell line which is 
genotypically deficient in the selection gene. Examples include 
the genes for hydroxymetholglutanyl CoA reductase (Sinensky, 
"Biochem. Biophys. Res. Commun - 2&:863 11977], ribonucleotide 
reductase (Meutb et al. "Cell tli 367 £1943] , aspartate 
transcarbamylasej (Kemp et al. "Cell- 9.1541 [19761 , adenylate ' 
deaminase (DeBatisse et al. "Mol and Cell Biol.* 2(11) :1346-1353 
11982] mouse dihydrof olate- reductase (DHPR) and, with a defective 
promoter, mouse thymidine kinase (TK) . 

Dominant selection genes are those which are expressed in 
transformants regardless of the genotype of the parental cell. 
Most dominant selection genes are not detectably amplified 
because the phenotype is so highly effective in dealing with the 
selection agent that it is difficult to discriminate among cell 
lines that have or have not amplified the gene. Examples of 
dominant selection genes of this type include the genes for 
procaryotic enzymes such as xanthine-guanine 
phosphoribosyltransf erase (Mulligan et al. "Proc. Nat. Acad. 
Sci." 24(4] i 2072-2076 11981] and aminoglycoside 3« - 
phosphotransferase (Colbere-Garapin et al., "J. Mol. Biol.", 
1511:1-14 [1981]. 

Some dominant selection genes also are detectably amplified. 
Suitable examples include the mutant DHPR gene described by Haber 
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et al., "Somatic Cell Genet." 1:499-508 (1982), cell surface 

markers such as HLA antigens and genes coding for enzymes such as 

specific esterases that produce fluorescent or colored products 

from fluorogenic or chromogenic substrates as is known in the 
art. 

Detectably-amplif ied, dominant selection genes are preferred 
for use herein, it should be understood that a dominant 
selection gene in some cases can be converted to a detectably 
amplified gene by suitable mutations in the gene. 

Selection genes at first were of limited commercial utility. 
While they enabled one to select transf ormants having the 
propensity to amplify uptaken dna, most selection genes produced 
products of no commercial value. On the other hand, genes for 
products which were commercially valuable generally did not 
confer readily selectable (or even detectable) phenotypes on 
their transf ormants. This would be the case, for example, .with 
enzymes or hormones which do not provide transformed cells with 
unique nutrient metabolic or detoxification capabilities. Most 
proteins of commercial interest fall into this group, e.g. 
hormones, proteins participating in blood coagulation and 
fibrinolytic enzymes. 

Thus, vectors used in EPA transformation can contain a 
selection gene and the EPA gene, m addition, there usually will 
be present in the transformation vectors other elements such as 
enhancers, promoters, introns, accessory DMA, polyadenylation 
sites and 3' noncoding regions as will be described below. 
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Suitable selection genes are described above. It is 
preferred that the selection agent be one that prevents cell 
growth in the absence of the selection gene. That way, revertant 
cells in large scale culture that lose the selection gene (and 
presumably the EPA gene as well) will not over-grow the 
fermentation. However,, it would be desirable in the commercial 
production of EPA to avoid the use of cell toxins, thereby 
simplifying the product purification steps. Thus, a desirable 
selection gene would be one that enables transf ormants to use a 
nutrient critical for growth that they otherwise would not be 
able to use. The TK gene described above is an example. 

Two classes of vectors can be employed in transformation. 
The first class are the unlinked vectors. - Here the selection 
gene and the EPA gene are not covalently bound. This vector 
class is sometimes preferred because the step of ligating or 
otherwise bonding the two genes is not required. This simplifies 
the transformation process because the selection and product 
genes usually are obtained from separate sources and are not 
ligated in their wild-type environment, in addition, the molar 
ratio of the EPA and selection genes employed during 
transformation can be adjusted to increase transformation 
efficiency. 

The second class of vectors is linked vectors. These 
vectors are distinguished from unlinked vectors in that the 
selection and EPA genes are covalently bound, preferably by 
ligation. 
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The vectors herein nay also include enhancers. Enhancers 
are functionally distinct from promoters, but appear to operate 
in concert with promoters. Their function on the cellular level 
is not well understood, but their unique characteristic is the 
ability to activate or potentiate transcription without being 
position or orientation dependent. Promoters need to be upstream 
of the gene, while enhancers may be present upstream or 5' from 
the promoter, within the gene as an intron, or downstream from 
the gene between the gene and a polyadenylation site or 3' from 
the polyadenylation site, inverted promoters are not functional, 
but inverted enhancers are. Enhancers are cis-acting, i.e., they 
have an effect on promoters only if they are present on the same 
DNA strand. For a general discussion of enhancers see Khoury et 
al., "Cell" 22:313-314 (1983). 

Preferred enhancers are obtained from animal viruses such as 
simian virus 40, polyoma virus, bovine papilloma virus, 
retrovirus or adenovirus. Ideally, the enhancer should be from a 
virus for which the host cell is permissive, i.e. which normally 
infects cells of the host type, viral enhancers may J* obtained 
readily from publically available viruses. The enhancer regions 
for several viruses, e.g., Rous sarcoma virus and simian virus 
40, are well known. See Luciw et al., -Cell- 11:705-716 (1983). 
It would be a matter of routine molecular biology to excise these 
regions on the basis of published restriction maps for the virus 
in question and, if necessary, modify the sites to enable 
splicing the enhancer into the vector as desired. Por example, 
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see Kaufman et al, "J. Hoi. Biol.", 601-621 (1982) and "Hoi. 
Cell Biol." 2(11) :13Q4-1319 (1982). Alternatively, the enhancer 
may be synthesized from sequence data; the sizes of viral 
enhancers (generally less than about 150 bp) are sufficiently 
small that this could be accomplished practically. 

Another element which should be present in the vector 
assembly is a polyadenylation splicing (or addition) site. This 
is a DNA sequence located downstream from the translated regions 
of a gene, shortly downstream from which in turn transcription 
stops and adenine ribonucleotides are added, to form a polyadenine 
nucleotide tail at the 3" end of the messenger UNA. 
Polyadenylation is important in stabilizing the messenger RNA 
against degradation in the cell, an event that reduces the level 
of messenger RNA and hence the level of product protein. 

Eucaryotic polyadenylation sites are well known. A 
concensus sequence exists among eucaryotic genes: The 
hexanucleotide 5 *— AADAAA-3 • is found 11-30 nucleotides from the 
point at which polyadenylation starts. DNA sequences containing 
polyadenylation sites may be obtained from viruses in accord with 
published reports. Exemplary polyadenylation sequences can be 
obtained from mouse beta-globin, and simian virus 40 late or 
early region genes, but viral polyadenylation sites are 
preferred, since these sequences are known, they may be 
synthesized la vitfQ and ligated to the vectors in conventional 
fashion. 

A polyadenylation region must be located downstream from the 



WO 86/02100 W W PCT/US85/01900 

-23- 

EPA gene. The sequence which separatee the polyadenylation site 
from the translational stop codon is preferably an untranslated 
dna oligonucleotide such as an unpromoted eucaryotic gene. Since 
such oligonucleotides and genes are not endowed with a promoter 
they will not be expressed. The oligonucleotide should extend 
for a considerable distance, on the order of up to about 1,000 
bases, from the stop codon to the polyadenylation site. This 3« 
untranslated oligonucleotide generally results in an increase in 
product yields. The vector may terminate from about 10 to about 
30 bp downstream from the concensus sequence for polyadenylation, 
but it is preferable to retain the 3' sequences found downstream 
from this site in its wild-type environment. These sequences 
typically extend about from 200 to 600 base pairs downstream from 
the polyadenylation site. 

The vectors described herein may be synthesized by 
techniques well known to those skilled in this art. The 
components of the vectors such selection genes, enhancers, 
promoters, and the like may be obtained from natural sources or 
synthesized as described above. Basically, if the components are 
found in DNA available in large quantity, e.g. components such as 
viral functions, or if they may be synthesized, e.g. 
polyadenylation sites, then with appropriate use of restriction 
enzymes large quantities of vector may be obtained by simply 
culturing the source organism, digesting its DNA with an 
appropriate endonuclease, separating the DNA fragments, 
identifying the DNA containing the element of interest and 
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recovering same. Ordinarily, a transformation vector will be 
assembled in small quantity and then ligated to a suitable 
autonomously replicating synthesis vector such as a procaryotic 
plasmid or phage. The pBR322 plasmid may be used in most cases. 
See Kaufman et al., £fi. cit . 

The synthesis vectors are used to clone the ligated 
transformation vectors in conventional fashion, e.g. by 
transfection of a permissive procaryotic organism, replication of 
the synthesis vector to high copy number and recovery of the 
synthesis vector by cell lysis and separation of the synthesis 
vector from cell debris. 

The resulting harvest of synthesis vector may be directly 
transf ected into eucaryotic cells, or the transformation vector 
may be rescued from the synthesis vector by appropriate 
endonuclease digestion, separation by molecular weight and 
recovery of the transformation vector. Transformation vector 
rescue is not necessary so long as the remainder of the synthesis 
vector does not adversely affect eucaryotic gene amplification, 
transcription or translation. For example, the preferred 
synthesis vector herein is a mutant of the e. con plasmid pBR322 
in which sequences have been deleted that are deleterious to 
eucaryotic cells. See Kaufman et al., ££. cit . use of this 
mutant obviates any need to delete the plasmid residue prior to 
transformation. 

The cells to. be transformed can be any prokaryotic cell or 
any eucaryotic cell, including yeast protoplasts and insect 
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cells, but ordinarily will be a nonfungal cell. Candidate cells 
need not be genotypically deficient in the selection gene so long 
as the selection gene is dominant acting. 

Eucaryotic cells preferably will be stable mammalian cell 
lines as is discussed above. Cell lines that are known to stably 
integrate selection genes into their chromosomal DNA are best, 
for example Chinese hamster ovary (CHO) cell lines. Also useable 
are HeLa, COS monkey cells, melanoma cell lines such as the Bowes 
cell line, mouse L cells, mouse fibroblasts and mouse NIH 3T3 
cells. 

Methods for facilitating cellular uptake of DNA are 
described above. Microinjection of the vector into the cell 
nucleus will yield the highest DNA uptake efficiencies, but 
exposing parental cells to DNA in the form of a calcium phosphate 
precipitate is generally most convenient. 

The population of cells that has been exposed to 
transforming conditions is then processed to identify the 
transf ormants. Only a small subpopulation of any culture which 
has been treated for transformation will exhibit the phenotype of 
the selection gene. The cells in the culture are screened for 
the phenotype. This can be accomplished by assaying the cells 
individually with a cell sorting device where the phenotype is 
one that will produce a signal, e.g. fluorescence upon cleavage 
of a fluorogenic substrate by an enzyme produced by the selection 
gene. Preferably, however, the phenotype enables only 
transf ormants to grow or survive in specialized growth media as 
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is further discussed above. 

Selection transf ormants then will be screened for ligation 
of the product gene into their chromosomes or for expression of 
the product itself. The former can be accomplished using 
Southern blot analysis, the latter by standard immunological or 
enzymatic assays. 

.Once the tranf ormants have been identified, steps are taken 
to amplify expression of the product gene by further cloning in 
the presence of a selection agent such as methotrexate (MTX) • 

The DNA sequence coding for a protein exhibiting EPA 
activity in accord with the present invention, such as 
illustrated in Pig. 4, can be modified by conventional techniques 
to produce variations in the final EPA protein which still have 
EPA activity in the assay tests described herein. Thus, for 
example, one, two, three, four or five amino acids can be 
replaced by other amino acids. Belgian Patent No. 898,016, which 
is incorporated herein by reference, describes one such typical 
technique for replacing cysteine by e.g., serine. 

EPA/cDNA in accord with this invention includes the mature 
EPA/cDNA gene preceded by an ATG codon and EPA/cDNA coding for 
allelic variations of EPA protein. One allele is illustrated in 
Pig. 4. The EPA protein of this invention includes the 
1-methionine derivative of EPA protein (Met- EPA) and allelic 
variations of EPA protein. The mature EPA protein illustrated by 
the sequence in Pig. 4 begins with the sequence 
Ala.Pro.phe.Glu... the beginning of which is depicted by a arrow 
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after nucleotide number 44 in Pig. 4. The Met-EPA would 
begin with the sequence Met .Ala.Pro.Phe.Glu. . . EPA 
protein of the present invention exhibits a specific 
activity of at least 10 6 units/mg of protein. 

As aforesaid, the EPA protein in accord with the 
present invention can be used for treatment of patients 
having low red blood cell counts to stimulate growth 
and formation of erythroid cells. For such use, a daily 
dosage of about 200 to 1000 ug per patient is typically 
indicated. The EPA protein is preferably injected into 
the patient intravenously in a suitable pharmacological 
carrier. Examples of such carriers include pharmaco- 
logical saline and human serum albumin in saline. 

Hence/ embodiments of the invention and subject 
matter provided thereby include/ without limitation: 
1) a method for preparing an expression vector contain- 
ing an EPA gene or DNA sequence coding for EPA/ said 
method being as generally outlined and specifically 
described herein; 2) cells transformed with an expres- 
sion vector prepared according to the method indicated 
in 1)/ above, including 2-A) prokaryotic cells, 2-B) 
eukaryotic cells and 2-C) mammalian cells; 3) cloned 
DNA sequences which code for EPA including the sequences 
3-A) set forth in Pig. 4 and 3-B) set forth in Pig. 5); 
4) Recombinant expression vectors comprising a DNA 
sequence which codes for EPA protein; 5) transformed 
cells comprising an expression vector as indicated in 
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4) i above/ including 5-A) prokaryotic cells/ 5-B) 
eukaryotic cells/ 5-c) yeast cells and 5-D) mammalian 
cells; 6) EPA protein made by expressing a recombinant 
expression vector containing an EPA gene or DNA sequence 
coding therefor in a transformed cell including 6-A) 
prokaryotic cells / 6-B) eukaryotic cells/ 6-C) yeast 
cells and 6-D) mammalian cells; 7) Human EPA protein 
substantially free of other protein of human origin; 
8) Gibbon ape EPA protein substantially free of other 
protein of Gibbon origin; 9) EPA protein substantially 
free of glycosylation; 10) EPA protein glycosylated by 
expression of a recombinant expression vector containing 
an EPA gene or DNA sequence coding therefor in a trans- 
formed eukaryotic cell; 11) EPA protein having an 
activity of at least about 10 6 units per milligram of 
protein including 11-A) protein of human origin/ 11-B) 
protein of Gibbon ape origin/ 11-C) EPA protein having 
the amino acid sequence set forth in Fig. 4 or an allelic 
variation thereof and 11-D) EPA protein having the amino 
acid sequence set forth in Pig. 5 or an allelic variation 
thereof; 12) cDNA which codes for EPA including 12-A) 
cDNA having the nucleotide sequence illustrated in Fig. 
4 and 12-B) cDNA having the nucleotide sequence illus-. 
trated in Fig. 5; 13) expression vectors comprising 
cDNA coding for EPA including 13-A) those comprising 
the nucleotide sequence illustrated in Fig. 4 and 13-B) 
those comprising the nucleotide sequence illustrated 
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in Pig. 5; 14) cells transformed with an expression 
vector as indicated in 13, 13-A and 13-B, above, 
including in each case 14-A) prokaryotic cells, 14-B) 
eukaryotic cells', 14-C) yeast cells, 14-D) mammalian 
cells and 14-E) insect cells; 15) EPA protein made by 
expressing cDNA coding for EPA in a transformed cell 

J 

including 15-A) prokaryotic cells, 15-B) eukaryotic 
cells, 15-C) yeast cells, 15-D) mammalian cells and 
15-E) insect cells; 16) EPA protein substantially free 
of other protein of native origin; 17) EPA protein 
glycosylated by expression of cDNA coding for EPA in a 
transformed eukaryotic cell including 17-A) mammalian 
cell and 17-B) insect cells; 18) EPA protein made by 
expressing in a cell cDNA having the nucleotide sequence 
illustrated in Pig. 4 and Pig. 5 including expression 
in each case in 18-A) prokaryotic cells, 18-B) eukaryo- 
tic cells, 18-C) yeast cells, 18-D) mammalian cells and 
18-E) insect cells; 19) purified EPA protein conforming 
to the amino acid sequence Met-EPA including 19-A) such 
protein having an activity of at least about 10 6 units, 
per milligram of protein; 20) purified EPA protein which 
is a mixture of EPA and Met-EPA including 20-A) such 
protein having an activity of at least about 10 6 units 
per milligram of protein; 21) therapeutic compositions 
for the treatment of mammals, particularly primates, to 
stimulate the formation and growth of erythroid cells 
comprising erythroid cell formation and growth stimulating 



WO 86/02100 W W PCT/US85/01900 

-30- 

effective amount of EPA protein in a pharraaceutically 
acceptable carrier or diluent; 22) methods for the 
treatment of mammals having low red blood cell counts 
comprising administering to said mammal an effective 
amount of EPA protein including 22-A) administering 
said ef f ective .amount intravenously such as by injection 
in the form of a therapeutic composition comprising 
EPA protein in a pharmaceutical^ acceptable carrier; 
23) methods for treating anemia associated with chronic 
renal failure comprising administering an effective 
amount of EPA protein; and 24) method for treating as 
adjunctive therapy the adverse effects including reduced 
red blood cell counts or anemia of chemotherapy including 
24-A) cancer chemotherapy/ in patients receiving chemo- 
therapy causing such adverse effects. 

The invention will be further understood with 
reference to the following illustrative embodiments/ 
which are purely exemplary/ and should not' be taken as 
limitive of the true scope of the present invention, as 
described in the claims. 

Unless otherwise noted/ restriction endonucleases 
are utilized under the conditions and in the manner 
recommended by their commercial suppliers. Ligation 
reactions are carried on as described by Maniatis/ T. 
et al./ Molecular Cloning - A Laboratory Manual Cold 
Spring Harbor Laboratory/ Cold Spring Harbor/ NY 1982 
p. 245-6, the disclosure of which is incorporated herein 
by reference/ using the buffer described at page 246 
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thereof and using a DNA concentration of 1-100 ug/ml, at 
a temperature of 23°C for blunt ended DNA and 16°C for 
"sticky ended" DNA. "Phosphataaing" as described herein, 
refers to dephosphorylation of DNA, and is carried out 
in the manner described by Maniatis et al., supra , e.g. 
at page 133 et seq. "Kinasing" refers to phosphorylation 
of DNA. Electrophoresis is done in 0.5-1.5% Agarose gels 
containing 90 mM Tris-borate, 10 mM EDTA. "Nick-transla- 
ting" refers to the method for labeling DNA with 32p as 
described by Rigby et al., J. Mol. Biol. , 113:237 (1977). 
All radiolabeled DNA is labeled with 32p, whatever 
labelling technique was used. 

By "rapid prep" is meant a rapid, small scale pro- 
duction of bacteriophage or plasmic DNA, e.g., as des- 
cribed by Maniatis et al., supra , at p. 365-373. 

In the examples, unless otherwise specified, tem- 
peratures are in °C. 

EXAMPLE 1 ' 
Step A. Culture Of The Mo Cells 

To obtain large volumes of Mo cells, cells were grown 
in T flasks containing about 200 ml of culture medium. 
Lots of 10 to 20 liters were prepared. A variant of the 
original Mo T-cell line was used to prepare the condi- 
tioned medium because these cells are able to grow in the 
absence of bovine serum (Chen, I.S., et al 1983 Nature 
305: 502-505). Cells were grown for expansion at 37°C in 
Iscove's modified Dulbecco's medium (Irvine Scientific) 
supplemented with 0.1 percent bovine serum albumin (Irvine 
Scientific, selected lots), 0.035 percent dextrose, 2.2 uM 
human transferrin (Sigma) and antibiotics, to a final density 
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of about 10 6 per ml. The cells were centrifuged, and transferred 
to twice the original volume in a medium consisting only of 
Iscove's modified Dulbecco's medium containing antibiotics. They 
were incubated for an additional week, and then removed by 
centrif ugation. After collection the medium was heated to 56°C 
for 30 rain to inactivate the HTLV-II. 



Step B. Biological &g H *y fl p or EPA A ff »H v ^y 

One method used to assay EPA is a modification of the 
previously described BPU assay. See Hunt, et al, J. Biol. chAm. 

pp. 7042-5 (1981) . The peripheral blood leukocyte fraction 
(buffy coat) obtained from healthy donors was plated in a mixture 
consisting of 0.8 percent methylcellulose (Dow Chemical Co.,), 
3.0 percent fetal bovine serum (Irvine Scientific and Hyclone 
selected lots), 0.5 0/ml of human urinary erythropoietin 
(supplied by the National Heart, Lung and Blood Institute about 
1,140 u/mg protein), 1Q~ 6 M Pecl 3 , 4.4 uM human transferrin, 1000 
U/ml penicillin and 1000 ug/ml streptomycin, 10~ 4 M 
alpha-thioglycerol, 10~7 M selenium, 0.035 percent dextrose and 
Iscove's medium. The assays were done in 96-well tissue culture 
plates (Costar); each well contained 0.1 ml of methylcellulose 
mixture with cells at a density of 4 to 5 x 105 per ml, and 0.01 
ml of test material. The material to be tested was dissolved in 
Dulbecco's phosphate-buffered saline (PBS; Irvine Scientific) 
containing 0.01 percent bovine serum albumin and sterile 
filtered. Two-fold serial dilutions were made; each dilution was 
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assayed in duplicate. Erythroid bursts were scored at 7 to 10 
days using an inverted microscope. These bursts contain a 
minimum of 50 cells and are reddish brown in color indicating 
hemoglobinization of cells. Typically, control wells contained 
approximately 30 erythroid bursts per well. The last dilution 
which gave stimulation greater than 30 percent above control was 
considered to have 1 unit/ml of EPA. 

BFU-E were also assayed from normal human bone marrow by a 
modification of a previously described technique, which employs' 
microtiter dishes (Hunt, et al, supra); light density bone marrow 
cells were plated at 3 x 10* cells/ml in Iscove's medium 
containing 10 percent fetal bovine serum, 1 u/ml of human 
erythropoietin, 0.8 percent methylcellulose. and antibiotics. 
Human bone marrow CFU-E were enumerated at 7 days from plates 
containing 2 x 10* bone marrow cells/plate (Hunt, R. et al 1981 
J . B i ol , rhfffl . 256: 7042-7045). Colony formation by K562 
erythroleukemia cells was analyzed as described by Gauwerky, C. 
E. , et al 1982 Blood 59x 300-305. 

Step C. Pur if i nalH on Df FP& 

All steps except HPLC were carried out at 4°C. After lentil 
lectin chromatography, manipulations were performed in 
polypropylene tubes or siliconized glassware to minimize loss by 
adsorption. Protein concentration was measured by the Bio-Rad 
method (Bio-Rad Laboratories). 

Phenylmethylsulfonyl fluoride was added to 10 liters of 
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conditioned medium at a final concentration of 1 mH to inhibit 
proteolysis. The medium was then concentrated 15-fold with an 
Amicon hollow-fiber apparatus, using a Type H1P10-8 filter. The 
concentrate was brought to 80 percent saturation by the addition 
of solid ammonium sulfate. The precipitate was collected after 
stirring on ice for 4 to 12 hours, dialyzed against PBS, and 
clarified by centrif ugation. This material was applied to a 
column (2.6 cm x 6 cm) of lentil lectin Sepharose 4B (Pharmacia) 
that had been equilibrated with PBS. The column was then washed 
with two column volumes of buffer, and EPA was eluted with 15 ml 
of 0.5 M alpha-raethyl-D-mannoside in PBS. The eluate was 
collected and concentrated in an amicon stirred ultrafiltration 
cell, using a FH10 membrane, to a volume of ' 6 ml. The material 
was then applied to an Ultrogel ACA 44 column (1.6 x 80 cm; LKB) 
equilibrated with PBS as the running buffer. The flow rate was 6 
ml per hour, and fractions of 2.5 ml were collected. Fractions 
with the highest specific activity were combined and applied 
directly to the HPLC column without additional manipulation. 
Reverse phase high performance liquid chromatography, HPLC, was 
performed on a Beckman apparatus, using a Vydac C-18 (10 x 250 
ram) column with a starting solvent of 10 percent acetonitrile and 
0.1 percent trifluoroacetic acid in water. The column was eluted 
with increasing concentrations of acetonitrile in trifluoroacetic 
acid using the following gradient: 10 percent to 30 percent 
acetonitrile over 10 min. then 30 percent to 60 percent over 60 
min. The flow rate was 1.5 ml per rain. Ultraviolet absorbance 
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was monitored at 220 nra, and protein peaks were collected 
manually, evaporated to dryness, and resuspended in a small 
volume of PBS. 

Sodium dodecyl sulfate polyacrylamide gel electrophoresis 
(SDS-PAGE) , was performed according to the method described by 
Laemmli, Nature 221, pp. 680-5 (1970) . Proteins were visualized 
by iodination prior to electrophoresis. Iodinations were 
performed using a modification of the chloramine I method 
described by Hunter, et al, Nature 1M, pp. 495-6 (1962), after 
reaction with 125! and chloramine T the samples were not desalted 
by gel filtration but were applied directly to the gel. The 
tracking dye was electrophoresed to the end of the gel and excess 
125x migrated as the dye front. This procedure obviated the need 
for the addition of carrier protein and allowed complete recovery 
of small amounts of protein. The gels were fixed in 50 percent 
methanol, lo percent acetic acid, soaked in 2 percent glycerol, 
dried, and exposed to film for 1 to 12 hours. 

EPA from the earlier purification steps (ammonium sulfate 
precipitation and lentil lectin eluate) was inhibitory at 
moderate concentrations. Because of these considerations it was 
necessary to assay EPA over a wide concentration range for 
accurate quantitation. The addition of 0.01 percent bovine serum 
albumin to the dilutions was necessary to prevent loss by 
nonspecific adsorption, especially during sterile filtration. 
Bovine serum albumin has a slight stimulatory effect in the 
assay, however, this effect was negligible at the concentration 
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(0.01 percent) used. 

It was difficult to accurately measure the amount of EPA in 
crude material and during the early steps of purification because 
of the presence of inhibitors of BPO-E growth. Although 
phenylmethylsulfonyl fluoride was routinely added to inhibit 
proteolysis, it may be unnecessary because the yields did not 
appear lower when it was omitted. 

The overall purification resulted in an apparent yield of 25 
percent; however, this figure is likely an overestimation due to 
inhibitors present in the crude material. Generally, 
approximately 25 ug is recovered from 10 liters of serum-free 
medium. The final product had a specif ic activity of 10^ o/mg 
protein indicating that it is biologically active in the 
picomolar range. EPA retains activity over several months when 
stored at 4°c. 

Highly purified "Sj-iabelled EPA was subjected to SDS-PAGE 
analysis on 10 percent polyacrylamide gels. The protein migrates 
as a single broad protein band (characteristic of many 
glycoproteins) at a molecular weight of 28,000 in the presence or 
absence of 10 percent beta-mercaptoethanol, suggesting that it 
exists as a single polypeptide chain. A single band was observed 
with either silver staining or autoradiography of 125i-labelled 
material. Attempts to elute the activity from SDS-PAGE gels by 
the method of Burgess et al, J. b!«i. rh* m 22Z, pp. 1998-2003 
(1977) yielded ambiguous results. Although activity appeared to 
coincide with the main protein fraction, the assays were 
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difficult to interpret because of the extreme variability of 
background activity due to the presence of BFU-E inhibitors in 
the gel eluates. when highly purified biologically active epa 
was analyzed again by reverse phase HPLC, a single protein peak 
was observed. Finally, approximately 10 ug of this purified 
material was analyzed in an automated microsequenator and 11 
cycles revealed single amino acids, confirming the presence of 
homogeneously purified protein. 

Step D. Svnthexia Of Prr>h fR 

Two preparations of EPA containing about 25 ug of protein 
were analyzed on a gas phase sequenator. Prom analyses of four 
separate samples, we deduced a partial EPA -amino acid sequence as 
follows t 

1 5 10 15 20 25 

(A)TCCPPH? QTAPCN SDLVIRAKPVGT 
Prom this sequence three different oligonucleotide probes were 
prepared. 

The first probe consisted of all sixty-four possible 12-mers 
that encode residues 9-13 of the above sequence (i.e. Q T A P) . 
These oligonucleotides were synthesized' in 8 pools each 
containing 8 sequences that would be complementary to the EPA 
mRNA sequence as follows: 



12-1 d (A-A-A-A-G-C-A-G-T-T-T-G) 
G G G 
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12-2 d (A-A-A-T-G-C-A-G-T-T-T-G-) 
G G G 

12-3 d (A-A-A-A-G-OT-G-T-T-T-G) ■ 
G G C 

12-4 d (A-A-A-T-G-C-T-G-T-T-T-G) 
G C C 

12-5 d (A-A-A-A-G-C-A-G-T-C-T-G) 
G G G . 

12-6 d (A-A-A-T-G-C-A-G-T-C-T-G) 
G C G 

12-7 d (A-A-A-A-G-C-T-G-T-C-T-G) 
G G C 

12-8 d(A-A-A-T-G-C-T-G-T-C-T-G) 
G C C 

The second probe consisted of all sixty-four 14-mers predicted 
from the sequence of residues 21-25 (AKFVG) . Again, these 
compounds were synthesized in 8 pools of 8 sequences 
complementary to the raRNA sequences 



14-1 d (C-C-T-A-C-A-A-A-T-T-T-T-T-G-C) 
C G C 

14-2 d (C-C-A-A-C-A-A-A-T-T-T-T-G-C) 
G G C 

14-3 d (C-C-T-A-C-A-A-A-T-T-T-C-G-C) 
C G C 

14-4 d (C-C-A-A-C-A-A-A-T-T-T-C-G-C) 
G G C 

14-5 d (C-C-T-A-C-A-A-A-T-T-T-G-G-C) 
C G C 

14-6 d (C-C-A-A-C-A-A-A-T-T-T-G-G-C) 
G G C 

14-7 d (C-C-T-A-C-A-A-A-T-T-T-A-G-C) 
C G C 
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14-8 d (C-C-A-A-O-A-A-A-T-T-T-A-G-C) 
G G C 

For the third probe, two pools of sixteen 53-raers were 
synthesized in which many of the possible sequences were 
eliminated by making guesses based on codon usage preferences 
(Mechanism of Ag i ng and Developing 118 (4) : 285-314 (1982)). The 
53-mers covered the region from residue 9 to residue 26. One 
pool (53-1) was synthesized to be complementary to the mRNA, 
while the other (53-2) was synthesized to have the same sense aa 
the mRNA. The sequences were as follows: 

53-1 d (G-T-G-C-C-C-A-C-A-A-C-T-T-A-G-C-C-C-G-G-A-T-G-A-C-C-A-G- 

G T C 

G-T-C-A-G-A-G-T-T-A-C-A-G-A-A-G-G-C-A-G-T-C-T-G) 

G 

53-2 d (C-A-G-A-C-T-G-C-C-T-T-C-T-G-T-A-A-C-A-G-T-G-A-T-C-T-G-G- 



T-G-A-T-C-C-G-G-G-C-T-A-A-G-T-T-T-G-T-G-G-G-C-A-C) 
AC 

Step E. Preparation Of rnwA T.ihrary 

Mo cells were induced for 24 hr with PHA and PMA to enhance 
their lympbokine production (2 x 10* cells at 10* cells/ml in 
RPMI 1640 + 5% PCS) . The cells were collected by centrif ugation 
and cytoplasmic RNA was prepared by a standard gentle lysis 
procedure. Polyadenylated messenger RNA was prepared by 
chromatography on oligo dT cellulose. 

First strand cDNA was prepared using standard methods 
beginning with lOug of mRNA. The mRNA was removed by base 



C 



C 
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treatment and the cDNA was purified by gel exclusion 
chromatography (Sepbarose CL-4B) and concentrated by ethanol 
precipitation. The yield was approximately 1 ug. 

To try to enrich for sequences specific for Mo cells, the 
1st strand cDNA was hybridized in lOul with an excess of mRNA 
isolated from cells which do not make lymphokines. The cell 
lines used were Daudi (a B-cell line) and CEM (an immature T-cell 
line) . Most of the sequences expressed in Mo, CEM and Daudi will 
be the same and only a small fraction will be found only in Mo.' 
Therefore, by hybridizing 10 ug of CEM mRNA and 10 ug of Daudi 
mRNA to 1 ug of Mo cDNA we expected that most of the cDNA would 
anneal to its complementary mRNA and become double-stranded. The 
Mo lymphokine cDNAs should remain single-standed. Since double 
and single stranded molecules can be resolved by chromatography 
on hydroxy lapatite, a substantial enrichment for lymphokine 
sequences should be achieved. After 18-24 hrs at 65°, the 
RNAxcDNA hybridization was passed over a 1 ml hydroxy lapatite 
column. The flow through of the column was collected (the 
single-strand fraction) and used to make a cDNA library. Roughly 
5% of the input cDNA was recovered. More than 90% was retained 
on the column (double-stranded) suggested that a 10-20 fold 
enrichment had been achieved. 

The single-stranded cDNA was concentrated by butanol 
extraction, passed over Sepharose C1-4B to remove phosphate 
buffer and collected by ethanol precipitation. The cDNA was then 
converted to a double-stranded form by treatment with the Klenow 
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f ragment of DNA polymerase I using standard methods. The 
•loopback- form was cleaved with Sl-nuclease and "C-tails- added 
with terminal transferase by standard methods. Finally, the 
tailed cDNA was annealed with G-tailed pBR322 (at the Pstl site) 
and used to transform E. coll. Prom a final yield of 30ng of 
double stranded cDNA,. approximately 40,000 clones were obtained 
which were spread onto 75 10 cm agar plates. The colonies were 
lifted off of the plates by placing pre-wetted nitrocellulose 
filter disks on top of them and removing the filters (colonies ' 
facing up) to fresh filters. After growing these colonies for 1 
hour at 37° , 2 replicas were made by carefully placing a 
pre-wetted filter on top of the original filter. The replica 
filters were aligned with the original by punching 3 asymmetrical 
holes through the filters with a needle. The replica filters 
were removed to fresh plates and grown for several hours at 37«. 
The plasmid DNA in each colony was amplified in situ by 
tansferring the filters to fresh plates and incubating overnight 
at 37°. Finally, these replica filters were prepared for 
standard colony hybridization by treatment with base followed by 
neutralization and drying. The dried filters were baked in Mflm 
for 2 hours at 80* to fix the liberated DNA to the 
nitrocellulose. 

Step f. Colony HvhrldlaaUnn 

To identify EPA clones, one set of replica filters was 
probed with the set of 64 12-mers and the other corresponding set 
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of duplicate filters was probed with the 64 14-roers. The 
hybridizations were performed at room temperature in 4x SSC, 5x 
Denhardt's, 100 ug/ml Herring Sperm DNA and 10 6 ug/ml of 
32 P- labelled oligonucleotide probe (labelled by polynucleotide 
kinase and 32p_ ATP ) . After overnight hybridization, the filters 
were washed with 2x SSC, 0.1% SDS at room temperature for 4-5 
hours. Finally, the filters were exposed to film overnight at 
-80°C using intensifying screens. 

Potential EPA clones were identified by aligning the 
duplicate filters and lookng for individual, colonies which 
hybridized to both independent probes (the 64 14-mers and 64 
12-mers). Ten such clones which appeared to hybridize to both 
probes were picked by aligning the x-ray film of the hybridized 
filter with the original master filter. These 10 colonies were 
picked and grown in liquid culture for the preparation of 
analytical amounts of DNA (prepared by the standard base 
treatment method). 

Step G. Southern ^nalvHin Of Th* Pnf ential rpr rlonM 

Samples of DNA from each of the 10 clones were cleaved with 
Pstl to excise the inserts. These digests wre analyzed by 
agarose gel electrophoresis. A variety of insert sizes were 
observed. The gel was next prepared for standard Southern 
analysis by incubating in base for 30 minutes, then in 1M Tris, 
1.5M NaCl for 30 minutes to neutralize the base. Nitrocellulose 
strips were placed oh either side of the gel. This sandwich was 
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placed on top of a 1 inch stack of paper towels and a second 
stack of towels was placed on top of the gel. dna was allowed to 
leach out of the gel overnight (in both directions) giving 
replica f liters of the gel. These filters were baked at 80 w c for 
2 hours 1q vacuo then probed overnight as described for the 
colony hybridization (1 filter with the 12 mers and 1 with 14 
mers) . After washing the filters and exposing them to film it 
was observed that the inserts from Beveral clones hybridized to 
both probes but three clones (designated 27, 57, 58) hybridized 
most strongly to the probes. These three clones all clearly had 
at least one internal Pstl site suggesting that they encoded the 
same sequence. To further test the clones, the Southern filters 
were washed to eliminate the radioactivity and were reprobed with 
the 2 pools of labeled 53-mers (hybridized as above except at 
37°C). Both sets of 53-mers hybridized strongly only with the 
three clones designated 27, 57, and 58 * again suggesting that the 
cdna inserts of these clones were good candidates for the gene 
encoding EPA. 

Step H. Sequencing The Candidate ep& pi ffnfn 

To determine the sequence of the clones in the region 
expected to encode the amino acid sequence determined from the 
purified protein we first determined which of the 8 pools of 
eight 14-mers best hybridized to the clone. To do this each pool 
was labelled separately with polynucleotide kinase and 32p- ATP 
and hybridized separately to a replica Southern filter prepared 
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as described above. Pool 14-5 proved to hybridize most strongly 
to clones 27, 57 and 58, indicating that this pool of 



the candidate clones. 

For sequencing. CsCl purified clone 58 DNA was denatured 
with base and sequenced using standard dideoxy sequencing with 
pool 14-5 as primer. This primer gave an unambiguous DNA 
sequence which clearly encoded residues 19 back to 1. in 
addition clone 58 contained 17 codons from a potential signal 
peptide. Prom this result, we concluded that we had identified a 
clone encoding the purified EPA protein. Since we were confident 
that clones 27 and 57 also encoded the EPA polypeptide, we 
determined the complete nucleotide sequence- of all three clones. 
This was accomplished by subcloning the Pstl. fragments of the 
clone (EPA tuned out to have 3 internal Pstl sites which generate 
four small fragments for sequencing) into M13 and sequencing them 
using the universal M13 primer. To align the sequences of the 
four Pstl fragments, olignonucleotide primers were synthesized 
for use in sequencing the plasmid DNA directly (as described 
above with pool 14-5). One of these primers; 

d (T-G-C-A-C-C-T-G-T-G-T-C-C-C-A-C-C-) ) 
encoded the end of the mature protein and could be used for 
insertion into the bacterial expression vector as well as 
sequencing at the 5" end of the coding region. A second primer; 

(d (G-A-G-G-A-G-T-T-T-C-T-C-A-T-T-G-C-T-G) ) 
encoded the region near the 3» end of the first Pstl fragment and 



oligonucleotides, would be a good primer for dideoxy sequencing of 
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could be used to sequence through the three Pstl sites towards 
the 3' end of the clone. The final primer; 

(d (C-C-A-C-A-A-G-C-A-A-T-G-A-G-G-T-G-C-C) ) 
was complementary to the mRNA strand near the 5» end of the 3' 
Pst fragment of the clone and could be used to sequence towards 
the 5' end of the clone through the 3 Pst I sites. The complete 
DNA sequence of clone 57 is illustrated in Pig. 4 along with the 
deduced amino acid sequence of the translated EPA protein product 
and the coding region was the same for the other two clones. 

step i. Expression Of epa Tn Mammalian Cella 

To prove that the protein purified and cloned as EPA really 
is epa it was necessary to express the cDNA* clone and demonstrate 
biological activity. Examination of the DNA sequence yielded no 
convenient restriction sites to excise the EPA coding region C3 
internal Pstl sites made this enzyme useless). To get around 
this problem, clone 57 DNA was cleaved with Bgl I which cleaves 
in pBR322 127 nucleotides away from the unique Pst I site. 

The Bgl I cut DNA was treated with the nuclease Bal 31 for a 
time sufficient to remove approximately 150 base pairs per end or 
300 total, from the DNA. The DNA was then cleaved with ivul which 
cuts pBR322 126 nucleotides on the other side of the Pstl site 
from the Bgl 1 site. This DNA was treated with the Klenow 
fragment of DNA polymerase 1 to render the ends blunt then 
ligated overnight with synthetic EcoRl linkers. The excess 
polymerized linkers were removed by digestion with EcoRl and the 
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approximately 800 base pair fragment was isolated from a 
preparative agarose gel. This DNA was recovered by the standard 
glass powder isolation method and was ligated to EcoRl linearized 
PBR322. Colonies containing EPA inserts were identified by 
colony hybridization using the labelled 14-mer (pool 14-5) as 
probe. Twelve colonies that scored as positive were picked, and 
their DNAs were prepared and analyzed by digestion with EcoRl. 
Three clones having inserts of the right size (about 800 base 
pairs) were sequenced using pool 14-5 as primer. 

One of these clones, in which the EcoRl linker had been 
inserted into the 5' non-coding region of EPA, was used to 
prepare the insert to be expressed in animal cells. This 800 
base pair EcoRl fragment was ligated with EcoRl digested 
transformation vector DNA. The ligation mix' was transformed into 
Ei coll and clones carrying the desired insert were identified by 
colony hybridization again, using labeled pool 14-5 DNA as probe. 
Twelve colonies which were identified as positive were used to 
prepare DNA. The orientation of their inserts was determined by 
digestion with Aval, an enzyme which cleaves near the 3' end of 
the EPA coding region. One clone, which was determined to have 
the coding region oriented correctly in the vector was used to 
prepare DNA which was then purified by equilibrium density 
centrifugation in CsCl. This purified DNA (lOug) was used to 
transfect monkey COS-7 cells. 
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ste P J - Construction of Ey r r»«^ 1on v » wt . nt . p o mifm 

The vector P AdD26SV P A(3) was described by (Kaufman et al., 
Mol. Cell Biol. 2(11), 1304-1319 119821. It haa the structure 
illustrated in Pig. i. Briefly this plasmid contains a mouse 
dihydrof olate reductase (DHPR) cDNA gene that is under 
transcriptional control of the adenovirus 2 (Ad2) major late 
promoter. A S' splice site is included in the adenovirus DNA and 
a 3' splice site, derived from an immunoglobulin gene, is present 
between the Ad2 major late promoter and the DHPR coding sequence. 
The SV40 early polyadenylation site is present downstream from 
the DHPR coding sequence. The procaryotic-derived section of 
pAdD26SV P A(3) is from pSVOd (Mellon, P., Parker, v., Gluzman, Y. 
and Haniatis, T. 1981, Cell 21x279-288) and does not contain the 
PBR322 sequences known to inhibit replication in mammalian cells 
(Lusky, M. , and Botchan, M. 1981, Nature (London) 221:79-81. 

pAdD26SVpA(3) was converted into plasmid pCVSVL2 as 
illustrated in Pig. 1. P AdD26SVpA(3) was converted into plasmid 
P AdD26SV P A(3) (d) by deletion of one of the two Pstl sites in 
pAdD26SVpA(3) . This was accomplished by a partial digestion with 
Pstl (using a deficiency of enzyme activity so that a 
subpopulation of linearized plasmids can be obtained in which 
only one Pstl site is cleaved) , then treatment with Klenow 
fragment of Pol I to blunt the ends, ligation to ^circularize 
the plasmid, transformation of S. coll, and screening for deletion 
of the Pstl site located V of the SV40 polyadenylation sequence. 
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The adenovirus tripartite leader and virus associated genes 
(VA genes) were inserted into pAdD26SVpA<3) (d) as illustrated in 
Pig. 1. First, pAdD26SvpA(3) (d) was cleaved with PvuII to make a 
linear molecule opened within the 3' portion of the first of the 
three elements comprising the tripartite leader. Then, pjaw 43 
(Zain et al. 1979, Cell 1£ 851) was digested with Xho I, treated 
with Klenow, digested with Pvull, and the 140 base pair fragment 
containing the second leader and part of the third leader was 
isolated by electrophoresis on an acrylamide gel (6% in Tris 
borate buffer; Maniatis et al. [1982] supra). The 140 bp 
fragment was then ligated to the Pvull digested pAdD26SVpA(3) (d) . 
The ligation product was used to transform E. colj , to 
tetracycline resistance and colonies were screened using the 
Grunstein-Hogness procedure using a 32p labelled probe 
hybridizing to the 140 base pair fragment. DNA was prepared from 
positively hybridizing colonies to test whether the Pvull site 
reconstructed was 5 'or 3' of the inserted 140 base pair DNA 
specific to the 2nd and 3rd adenovirus late leaders. In the 
correct orientation, the Pvull site is on the 5' side of the 140 
base pair insert. This plasmid is designated pTPL in Pig. 1. 

The Ava II 0 fragment of SV40 containing the SV40 enhancer 
sequence was obtained by digesting SV40 DNA with Ava II, blunting 
the ends with Klenow fragment of Pol I, ligating Xho I linkers to 
the fragments, digesting with Xho I to open the Xho I site, and 
isolating the fourth largest (D) fragment by gel electrophoresis. 
This fragment was then ligated to Xho I cut pTPL yielding the 
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plasmid pCVSVL2-TPL CPig. 2) . The orientation of the SV40 D 
Fragment in pCVSVL2-TPL was such that the SV40 late promoter was 
in the same orientation as the adenovirus major late promoter. 

To introduce the adenovirus virus associated (VA) genes into 
the PCVSVL2-TPL, first a plasmid was constructed that contained 
the adenovirus type 2 Hind III b fragment. Adenovirus type 2 DNA 
was digested with Hind ill and the B fragment was isolated after 
gel electrophoresis. This fragment was then inserted into pBR322 
which had previously been digested with Hind III. After 
transformation of E. coli to ampicillin resistance, the 
recombinants were screened for insertion of the Hind III B 
fragment and the inserted orientation was determined by 
restriction enzyme digestion. pBR322 - Ad Hind III b contains 
the adenovirus type 2 Hind in b fragment in the orientation 
depicted in Pig. 2. 

As illustrated in Pig. 2, the VA genes were conveniently 
obtained from plasmid P BR322-Ad Hind II by digesting with Hpa I, 
ligating EcoRl linkers and digesting with EcoRl, and recovering 
the 1.4kb fragment. This fragment having EcoRl sticky ends was 
then ligated into the EcoRl site of pC7SVL2-TPL (which had 
previously been digested with EcoRl) . After transformation of 
£011 HB101 and selection for tetracycline resistance, colonies 
were screened by filter hybridization to a DNA probe specific to 
the VA genes. DNA was prepared from positively hybridizing 
clones and characterized by restriction endonudease digestion. 
The product plasmid was designated p91023. 
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The 2 EcoRl sites in p91023 were removed. p91023 was cut to 
completion with EcoRl, generating two DNA fragments: one, about 
7Kb and, the other, a 1.4 Kb fragment containing the VA genes. 
The ends of both fragments are filled in using the Klenow 
fragment of Poll, and then both fragments, i.e. 1.3 Kb and 7Kb, 
were religated together. A plasmid p91023(A) containing the VA 
genes and similar to p91023 but deleted for the 2 EcoRl sites was 
identified by Grunstein-Hogness screening with the VA gene 
fragment, and by conventional restriction site analysis. 

Then the single Pstl site in p91023(A> was removed and 
replaced with an EcoRl site (Pig. 3). p91023(A) was cut to 
completion with Pstl, and then treated with Klenow fragment of 
Poll to generate flush ends. EcoRl linkers were ligated to the 
blunted Pstl site of p91023(A). The linear p91023(A), with EcoRl 
linkers attached at the blunted Pstl site, was separated from 
unligated linkers and digested to completion with EcoRl, and then 
religated. A plasmid p91023(B) was recovered and identified to 
have a structure similar to p91023(A), but with an EcoRl site 
situated at the previous Pstl site. 

EPA clone 57 prepared as described above (Step I) was 
digested with EcoRl to obtain the DNA coding for EPA. This DMA 
was ligated into the EcoRl site of p91023(B). Monkey COS-7 cells 
were transfected with the EPA containing plasmids and grown to 
express EPA. 

DNA transf ections were performed as described by Sompayrac & 
Danna, Pros, Natli Acad. ScJL t 78, pp. 7575-8 (1981) , with the 



WO 86/02100 W W PCT/US85/01900 

-51- 

addition of chloroquin treatment (Luthman and Magnusson, Nuc, 
Ac i ds Rea. 11, pp. 1295-1308 (1983)). Cells were rinsed with 
serum free media, and incubated at 37 degrees for 12 hours in 
Dulbecco's modified Eagle's medium containing DEAE Dextran (MW 
5,000,000 Pharmacia, 250 mg/ml) with .1M Tris pH 7.3 and 2 ug/ml 
of plasmid DNA. After incubation, the cells were rinsed with 
serum free media and treated 2 hours at 37 degrees with ImM 
chlorquin in media containing serum. Cells were subsequently fed 
with media and incubated for indicated periods of time. 

Conditioned medium from these cells greatly stimulated the 
growth of erythroid bursts and clusters in the standard EPA assay 
thereby demonstrating that the protein encoded by clone 57 is 
EPA. 

Step K. ExpresRion of epa m e. mn 

To express EPA in E. gflll it was necessary to exactly fuse 
the mature protein coding region to the ATG found in the 
expression plasmid pALP-181. Clone 57 containing the EPA gene 
has been deposited in vector pALP-1 81 transformed in coli as 
ATCC No. 39823. To prepare a fragment useful for this, the EcoRl 
insert generated in Step I was cloned into the EcoRl site of 
M13mp9. Single stranded DNA was isolated from this phage 
preparation and used as a template with the primer repair 
oligonucleotide as primer. When priming DNA synthesis with the 
Klenow fragment of DNA polymerase, this primer generated a 
complementary strand that begins exactly with T-G-C-A-C-C-.. . the 
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sequence of the amino terminus of the mature protein, single 
stranded regions of this molecule were removed by treatment with 
SI nuclease and the coding region of EPA was excised by digestion 
with BaraHl. The pALP-181 vector fragment was prepared by Kpnl 
digestion which opens the plasmid immediately adjacent to the ATG 
to be used in expressing proteins. By treating with Klenow, the 
Kpnl sticky ends are removed leaving a flush ATG in the desired 
location. Cleavage of the vector with BaraHl renders the vector 
capable of ligating with the flush to BamHl fragment carrying the 
mature coding region of EPA. These two DNAs were ligated 
together and used to transform E. con W3110 lambda Y13 9, a 
strain carrying a temperature sensitive lambda repressor protein 
(plating is done at 30° to keep the P L promoter of pALP-181 
turned off). .Colonies from the transformation were lifted onto 
filters and two replica filters were prepared for colony 
hybridization. One replica was hybridized to the primer repair 
oligonucleotide and the second filter was hybridized to a 
junction oligonucleotide having the sequence 

d(T-A-C-A-T-A-T-G-T-G-C-A-C-C-T) which spans the desired correct 
sequence of the mature EPA sequence fused to the ATG of the 
expression plasmid. Pour colonies which hybridized to both 
probes were grown and checked for the ability to produce EPA. To 
do this, 50 ml cultures were grown for several generations at 
30°. To inactivate the repressor protein the temperature was 
shifted to 38° and incubation continued for two hours. The cells 
were harvested by centrif ugation and lysed by sonication. The 
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extract was diluted in laerali SDS sample buffer and analyzed by 
SDS polyacrylamide gel electrophoresis. That the EPA polypeptide 
had been expressed was indicated by the appearance of a 28,000 
dalton polypeptide in all four samples that was not seen in 
strains not carrying the EPA insert. That this protein was 
active was be demonstrated by assaying highly diluted samples of 
the extract. The EPA protein can be purified to substantial 
homogeneity by the procedure described in Step C above. 

EXAMPT.E 9 

Step A: mRNA Preparation Prr^ T h«> r.ihh nn T-fAn a 

A sample of the gibbon T-cell line designated UCD-MLA 144 
was cultured for several weeks in RPMI 1640. (purchased from 
Gibco) and 20% fetal calf serum (PCS) until there was obtained 1 
x 109 total cells. The cells were induced to produce high levels 
of EPA by activation for 24 hours in the presence of 10 nanograms 
per ml 12-0-tetradecanpylphorbol- 

13-acetate (TPA) in RPMI 1640 +1% PCS. The cells were harvested 
by centrifugation (1000 rpm., 5 min.) , washed once with phosphate 
buffered saline (PBS) and finally collected by centrifugation. 

Membrane bound polysome (MBP) mRNA was prepared from these 
cells using the same procedure as described in Mechler, b. and 
Rabbitts/ T.H. (1981), J. Cell Biol. 88:29. 



Step B. First Strand cDNA Pp a rH. n 

6 ug of MBP mRNA (from Step A) was diluted into a 50 ul cDNA 
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synthesis reaction mixture in accord with standard methods 
(Haniatis et al., supra) and the reaction initiated by the 
addition of reverse transcriptase. After incubation for 30 
minutes at_42°C, the reaction was stopped by addition of EDTA to 
50 mM, and diluted with H 2 0 to 100 ul. The mixture was extracted 
with phenol/chloroform and further extracted with chloroform. 
The cDNA/RNA hybrids were separated from unincorporated 
triphosphates by chromatography on a 2 ml Sepharose CL-4B column,, 
The excluded fractions were pooled and the hybrids collected by 
ethanol precipitation. The final yield was 570 ng. 

Step C. Second Strand cDNA Reaction 

The first strand cDNA pellet (from Step B) was resuspended 
in 50 ml of H2O, and second strand synthesis carried out in a 
standard reaction mixture with e. coli polymerase I, e. eoli 
ligase, and RNAse H. The reaction was incubated overnight at 
16°C and then incubated for 1 hour at 37°C. The reaction was 
stopped by addition of EDTA and extracted with phenol/chloroform. 
The cDNA was separated from unincorporated triphosphates by 
chromatography on a Sepharose CL-4B column, the excluded 
fractions pooled and the cDNA collected by ethanol precipitation. 

Step D. Recombinant cmiA Preparation 

The cDNA pellet (from Step C) was resuspended in 75 ul of 
H 2 0. Homopolymeric C "tails" were added to the ends of the cDNA 
by adding 10 ul of the cDNA solution to a 25 ul standard reaction 
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mixture with terminal transferase, and incubating at 30°c for 5 
minutes. The reaction was stopped by the addition of EDTA to 40 
mM and and heat inactivation at 68°c for 10 minutes. 10 ng of 
this tailed cDNA was annealed with 50 ng of G-tailed pBR322 
(purchased from NEN) in 10 ul of 10 mM Tris, pH 7.5, 1 mM EDTA, 
and 100 mM NaCl. The annealing reaction was incubated for 10 
minutes at 68»C and then for 2 hours at 57 •. 

Step E. Bacterial Transf ormaH^ 

E« CPU strain MC1061, was grown in L-b roth, chilled on ice, 
harvested by centrif ugation, and treated wth CaCl 2 to prepare 
them for transformation. 5 ul of the cDNA annealing reaction "was 
then incubated with 200 ul of the CaCl2-treated bacteria. 15 
such transformations were performed, using all of the annealed 
cDNA, and spread on 15 cm 1% agar L-broth plates containing 10 
ug/ml tetracycline. Approximately 1000 colonies grew on each 
plate. 

Step P. Reoliga PlaHn^ 

10,000 colonies from the transformation were each picked 
with a toothpick, transferred to fresh plates (500 per plate in a 
grid), and grown overnight at 37°C. The colonies were then 
lifted from each plate by pressing a dry nitrocellulose filter 
firmly over the surface of the plate. Two replica filters were 
prepared from each of these master filters. The master filters 
were stored at 4°C, and the replica filters treated with base, 
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and baked to prepare them for hybridization. 
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Step G. Preparation Of 32p Label IpH Hybridisat ion PrnhP B 

The cDNA insert from EPA Clone 57 (Example 1) was isolated 
by digestion with the restriction enzyme ecori . and 
electrophoresis in an agarose gel with Tris acetate and ethidium 
bromide. The band containing the cDNA fragment was cut from the 
gel and purified by the glass powder technique. 

300 ng of the cDNA fragment was then added to 1 ul of 10 x 
T4 DNA Polymerase Buffer (0.33 M Tris Acetate, pH 7.9, 0.66 M 
potassium acetate, 0.1 M Magnesium acetate and 10 mH 
dithiothreitol) , and 3 units of T4 DNA Polymerase (New England 
Biolabs), and diluted with water to 10 ul. • After incubation for- 
5-10 minutes at 37°C, this mixture was combined with 1 ul 10 x T4 
DNA Polymerase Buffer; 1 ul of a 2 mH solution of each of dCTP, 
dTTP, dGTP; 10 ul of 32 Po ; AT p (10 uCi/ul, 3,000 Ci/mmole) ; and 3 
units of T4 DNA Polymerase. The reaction was incubated for 20 
minutes at 37°C. Then 1 ul of 2 mM dATP was added and the 
reaction incubated for an additional 10 minutes at 37 °C. 

The unincorporated triphosphates were separated from the 
labelled cDNA by chromatography on a Sephadex G100 column. A 
second probe was prepared from a synthetic oligonucleotide having 
the sequence! 

ATG GCC CCC TTT G 
which is complimentary to the amino terminus of the EPA coding 
region. This oligonucleotide was labelled with 32 P dATP at its 
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5* end using a standard polynucleotide kinase reaction. 

Step H. Isolation of Gibbon EPA cDNA Clones 

In a standard hybridization screening procedure, some 45 
clones hybridized with the T4 labelled EPA Clone 57 cDNA. Of 
these/ approximately -12 also hybridized to the labelled 
oligonucleotide probe. Three of these clones were sequenced 
in accordance with conventional procedures (as -described for 
example in Nature, Vol. 35/ 27 June 1985:768-771) and each 
produced the EPA nucleotide gene sequence and deduced EPA 
amino acid protein sequence shown in Pig. 5. 

The invention has been described in detail, including the 
preferred embodiments thereof. However, it will be appreciated 
that those skilled in the art/ upon consideration of this 
specification, may make modifications and improvements within 
the spirit and scope of the invention. 
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What is claimed: 

1. A method for preparing an expression vector containing 
an EPA gene, said method comprising: 

inducing a T-lyrophocyte cell to enhance lymphokine 
production; 

preparing cytoplasmic RNA from said induced 

» 

T-lymphocyte; 

preparing polyadenylated messenger RNA from said 
cytoplasmic RNA; 

preparing single stranded cDNA from said messenger RNA; 

hybridizing said single stranded cDNA with a preparation 
from B-cell lines or immature T-cell lines to form double 
stranded molecules; 

separating the double stranded molecules from the 
remaining single stranded cDNA; 

converting the single stranded cDNA to double stranded 

cDNA; 

cloning the double stranded cDNA; 

hybridizing the clones with a pool of oligonucleotide 

probes ; 

selecting the clone that hybridizes most strongly with 
the probes; 

sequencing the clone that hybridizes most strongly to 
determine the oligonucleotide sequence of the EPA gene; and 
inserting the EPA gene into an expression vector. 
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2. Human EPA protein substantially free of other 
protein of human origin, 

3. Gibbon EPA protein substantially free of other 
protein of gibbon origin. 

4. EPA protein substantially free of glycosyla- 

tion. 

5. EPA protein glycosylated by expression of a 
recombinant expression vector containing an EPA gene in 
a transformed eukaryotic cell. 

6. cDNA that codes EPA. 

7. An expression vector comprising cDNA coding 
for EPA. 

8. A transformed cell comprising the expression 
vector of claim 7 or an allelic variation thereof. 

9. A therapeutic composition for the treatment 

of mammals to stimulate growth and formation of erythroid 
cells comprising erythroid cell growth and formation 
stimulating treatment amount of EPA protein in a phar- 
macological carrier. 

10. A method for the treatment of mammals having 
low red blood counts/ said method comprising treating 
said mammal with EPA protein. 
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10 20 30 40 

CCCGCAGATC CAGCGCCCAG AGAGACACCA GAGAACCCAC C ATG GCC CCC TTT 

MET Ala Pro Phe 

56 71 86 

GAG CCC CTG GCT TCT GGC ATC CTG TTG TTG CTG TGG CTG ATA GCC 
Glu Pro Leu Ala Ser Gly lie Leu Leu Leu Leu Trp Leu lie Ala 
101 H6 |3| 

CCC AGC AGG GCC TGC ACG TGT GTC CCA CCC CAC CCA CAG ACG GCC 
Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin Thr Ala 
146 161 176 

' TTC TGC AAT TCC GAC CTC GTC ATC AGG GCC A AG TTC GTG GGG ACA 
Phe Cys Asn Ser Asp Leu Val lie Arg A I a Lys Phe Vo I Gly Thr 

191 206 221 

CCA GAA GTC AAC CAG ACC ACC TTA TAC CAG CGT TAT GAG ATC AAG 

Pro Glu Val Asn G I n Th r Thr Leu Ty r G I n Arg Tyr Gl u I le Lys 

236 251 266 

ATG ACC AAG ATG TAT AAA GGG TTC CAA GCC TTA GGG GAT GCC GCT 

MET Thr Lys MET Tyr Lys Gly Phe Gl n Al a Leu Gly Asp Ala Al a 

281 296 31 1 

GAC ATC CGG TTC GTC TAC ACC CCC GCC ATG GAG AGT GTC TGC GGA 

Asp He Arg Phe Vol Tyr Thr Pro Ala MET Glu Ser Vol Cys Gly 

326 341 356 

TAC TTC CAC AGG TCC CAC AAC CGG AGC GAG GAG TTT CTC ATT GCT 

Tyr Phe His Arg Ser His Asn Arg Ser Glu Glu Phe Leu He Al a 

371 386 401 

GGA AAA CTG CAG GAT GGA CTC TTG CAC ATC ACT ACC TGC AGT TTT 

Gly Lys Leu Gin Asp Gly Leu Leu His lie Thr Thr Cys Ser Phe 

416 431 446 

GTG GCT CCC TGG AAC AGC CTG AGC TTA GCT CAG CGC CGG GGC TTC 

Vol Al a Pro Trp Asn Ser Leu Ser Le u Al a Gin Arg Arg Gly Phe 

461 476 491 

ACC AAG ACC TAC ACT GTT GGC TGT GAG GAA TGC ACA GTG TTT CCC 

Thr Lys Thr Tyr Thr Val Gly Cys Glu Glu Csy Thr Val Phe Pro 

506 521 536 

TGT TTA TCC ATC CCC TGC AAA CTG CAG AGT GGC ACT CAT TGC TTG 

Cys Leu Ser I le Pro Cys Lys Leu Gin Ser Gly Thr His Cys Leu 

551 566 581 

TGG ACG GAC CAG CTC CTC CAA GGC TCT GAA AAG GGC TTC CAG TCC 

Tr p Thr Asp Gin Leu Leu Gin Gly Ser Glu Lys Gl y Phe Glu Ser 

596 61 I 626 

CGT CAC CTT GCC TGC CTG CCT CGG GAG CCA GGG CTG TGC ACC TGG 

Arg His Leu Ala Cys Leu Pro Arg Glu Pro Gly Leu Cys Thr Tr p 

641 656 675 685 

CAG TCC CTG CGG TCC CAG ATA GCC TGA ATCCTGCCCG GAGTGGAAGC 
Gin Se r Leu Arg Ser Gin He Ala 

695 705 715 725 735 

TGAAGCCTGC ACA GTG TCC A CCCTGTTCCC ACTCCCATCT TTCTTCCGGA 

745 755 765 

CAATGAAATA AAGAGTTACA CCAGCAAAAA AAAAA 

FIG. 4 
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6 16 42 

GCCCAG AGAGACACCA GAGAACCCAC C ATG GCC CCC TTT GAG CCC 

MET Al a Pro Phe Glu Pro 

FI G S? T I CT £? C fT C , CTG 7 TG TTG CTG TGG CTG A "TA GCC CCC AGC 
Leu Alo Ser Gly He Leu Leu Leu Leu Trp Leu He Ala Pro Ser 

AGG GCC TGC ACC TGT.GTC CCA CCC CAC CCA CAG ACG GCC TTC TGC 
Arg Ala Cys Thr Cys Va I Pro Pro His Pro Gl h Thr Ala Phe Cys 

150 

AAT TCC GAC CTC GTC ATC A AG GCC AAG TTT GTG GGG ACA CCA GAA 
Asn Ser Asp Leu Vol I le Lys Ala Lys Phe Val Gly Thr Pro Glu 

SI? a AC S AG $P $S C 7 TA I AC £ AG CGT TAT gag ATC AAG ATG ACC 
Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Gl u I le Lys MET Thr 

240 

AAG ATG TAC AAA GGG TTC CAA GCC TTA GGG GAT GCC GCT GAC ATC 
Lys MET Tyr Lys Gly Phe Gin Alo Leu Gly Asp Ala Ala Asp He 

300 

CGG TTC GTC TAC ACC CCC GCC ATG GAG AGT GTC TGC GGA TAC TTC 
Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Leu Vol Gly Tyr. Phe 

CAC AGG TCC CAC AAC CGC AGC GAG GAG TTT CTC ATT GCT GGA AAA 
His Arg Ser His Asn Arg Ser Glu Glu Phe Leu He Ala Gly Lys 

CTG CAG GAC GGA CTC TTG CAC ATC ACC ACC TGC AGT TTC GTG GCT 
Leu Gin Asp Gl y Leu Leu His He Thr Thr Gys Ser Phe Val Al a 

420 450 
CCC TGG AAC AGT CTG AGC TTA GCT CAG CGC CGG GGC TTC ACC AAG 
Pro Trp Asn Ser, Leu Ser Leu Al o Gl n Arg Arg Gly Phe Thr Lys 

ACC TAC ACT GTT GGC TGT GAG GAA TGC ACA GTG TTT CCC TGT TTA 
Thr Tyr Thr Val Gly. Cys Glu Glu Csy Thr Vol Phe Pro Cys Leu 

510 

TCC ATC CCC TGC AAA CTG CAG AGT GGC ACT CAT TGC TTG TGG ACG 
Ser He Pro Cys Lys Leu Gin Ser Gly Thr His Cys Leu Trp Thr 

570 

GAC CAG CTC CTC CAA GGC TCT GAA AAG GGC TTC CAG TCC CGT CAC 
Asp Gin Leu Leu Gin Gly Ser Glu Lys Gly Phe Gin Ser Arg His 

630 

CTT GCC TGC CTG CCT CGG GAG CCA GGG CTG TGC ACC TGG CAG TCC 
Leu Ala Cys Leu Pro Arg Glu Pro Gly Leu Cys Thr Trp Gin Ser 

CTG CGG TCC CAG ATA GCC TGA ATCCTGCCCG GAGTGGAAGC 
Leu Arg Ser Gl n He Ala 

TGAAGCCTGC ACAGTGTCCA CCCTGTTCCC ACTCCCATCT TTCCTCTGGA 

731 741 
CAATGAAATA AAGAGTTACC ACCCAGCG 

FIG. 5 
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